CONSTRAINT SATISFACTION TRACTABILITY FROM SEMI-LATTICE 

OPERATIONS ON INFINITE SETS 



MANUEL BODIRSKY, H. DUGALD MACPHERSON, AND JOHAN THAPPER 

Abstract. A famous result by Jeavons, Cohen, and Gyssens shows that every constraint 
satisfaction problem (CSP) where the constraints are preserved by a semi-lattice operation 
can be solved in polynomial time. This is one of the basic facts for the so-called universal- 
algebraic approach to a systematic theory of tractability and hardness in finite domain con- 
straint satisfaction. 

Not surprisingly, the theorem of Jeavons et al. fails for arbitrary infinite domain CSPs. 
Many CSPs of practical interest, though, and in particular those CSPs that are motivated by 
qualitative reasoning calculi from Artificial Intelligence, can be formulated with constraint 
languages that are rather well-behaved from a model-theoretic point of view. In particular, 
the automorphism group of these constraint languages tends to be large in the sense that 
the number of orbits of n-subsets of the automorphism group is bounded by some function 
in n. 

In this paper we present a generalization of the theorem by Jeavons et al. to infinite 
domain CSPs where the number of orbits of n-subsets grows sub-exponentially in n, and 
prove that preservation under a semi-lattice operation for such CSPs implies polynomial- 
time tractability. Unlike the result of Jeavons et al., this includes many CSPs that cannot 
be solved by Datalog. 



1. Introduction 

Constraint satisfaction problems are fundamental computational problems that arise in 
many areas of theoretical computer science. In recent years, a considerable amount of research 
has been concentrated on the classification of those constraint satisfaction problems that can 
be solved in polynomial time, and those that are computationally hard. In this paper, we 
contribute to this line of research and generalize an important tractability condition from 
finite domain constraint satisfaction to a broad class of infinite domain constraint satisfaction 
problems. 

We work with the following definition of constraint satisfaction problems (CSPs), which is 
well-adapted to treat CSPs over infinite as well as finite domains. The definition is based on 
the concept of a homomorphism between relational structures, and equivalent to the standard 
definition for finite domain CSPs. A (relational) structure A consists of a (not necessarily 
finite) domain D{A) (or simply A when no confusion can arise), and a set of relations on 
D(A), each of a finite positive arity. Each relation is named by a relation symbol R; the 
corresponding relation in A is denoted by R A . The set of all relation symbols is called the 
signature of the structure. A homomorphism from a relational structure A to a relational 
structure B over the same signature is a mapping / : D{A) — > D(B) such that for each 
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relation symbol R, and tuple t G R , it holds that /(f) G where / is applied component- 
wise to t. The existence of a homomorphism from A to B is denoted by ^4 — > B. For a 
fixed structure, traditionally denoted by T, with finite relational signature r the constraint 
satisfaction problem for T (denoted by CSP(r)) is the following problem. 

csp(r) 

INSTANCE: A finite structure A over the signature r. 
QUESTION: Is there a homomorphism from A to T? 

To give an example, the three-colorability problem can be formulated as CSP(1\3), where 
K3 is the complete graph with three elements. A basic example of an infinite-domain CSP is 
CSP((Q; <)), where (Q; <) is the strict linear ordering of the rationals. 

Jeavons and co-authors [8, 18J made the ground-breaking observation that for finite struc- 
tures T, the complexity of CSP(r) is captured by the polymorphisms of T, defined as follows. 
When / : D k — > D is a fc-ary function, and R is an an ra-ary relation over D, then we say 
that / preserves R if for all n-tuples t±, . . . , t k G R, we have 



A polymorphism of a relational structure T with domain D = D(T) is a function from D k to 
D that preserves all relations of T. In other words, a polymorphism is a homomorphism from 
T k to r, for some k. The exploitation of polymorphisms for classifying the complexity of CSPs 
is sometimes referred to as the universal-algebraic approach. Indeed, very often tractability 
of CSP(r) is linked to polymorphisms of T with certain 'good properties'. 

For a finite domain D = D(T), it is known that CSP(r) is NP-hard unless T has a poly- 
morphism / : D k —7- D satisfying 




for all xi, . . . , xu G D, and it is conjectured that CSP(r) can be solved in polynomial-time 
whenever T has such a polymorphism [10 This conjecture is known to hold in several special 
cases when / satisfies stronger identities. We will now look at one such case. 
An operation / : D 2 — > D is called 

• idempotent if f(x,x) = x; 

• commutative if f(x,y) = f(y,x) for all x,y G D; 

• associative if f(x, f(y, z)) = f(f(x, y), z) for all x,y,z G D; 

• a semi-lattice operation if it is commutative, associative, and idempotent. 

Jeavons, Cohen, and Gyssens [18] proved that for every finite structure T with a polymor- 
phism that is a semi-lattice operation, the problem CSP(r) can be solved in polynomial time. 
In this paper, we present a generalization of this result to a large class of infinite-domain 
CSPs. All infinite structures considered are assumed to be countably infinite. To state the 
result, we need the following definitions. 

A bijective homomorphism with an inverse that is also a homomorphism is called an iso- 
morphism. An automorphism of a relational structure T is an isomorphism between T and 
itself, and the set of all automorphisms of T is denoted by Aut(r). For a subset S of the 

^Barto and Kozik [l] call / a cyclic term when it satisfies the additional requirement of being idempotent; 
f(x, . . . , x) = x for all x G D. They state the conditions for NP-hardness and the conjecture for polynomial- 
time tractability in terms of the absence or presence of such cyclic terms among the polymorphisms of the core 
o/r, cf. Section T3. 2 1 It is not hard to verify that their condition is equivalent to the one given here. 



(/(ti[l],. ..,**[!]),. ..,/(ti[n],...,tfcH)) eR. 




f{x 1 ,x 2 , ...,x k ) = f(x 2 ,x 3 , • • • , x k , Xl), 
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domain of T, the orbit of S in T is the set {{a(s) \ s G S} \ a G Aut(r)}. When S is of 
cardinality n, then we call the orbit of S in T an orbit of n-subsets. If the number of orbits of 
n-subsets of T is at least c n for some c > 1 and all sufficiently large n, then we say that T has 
exponential growth. Otherwise, we say that T has sub-exponential growth, or that T is a sub- 
exponential structure. Note that every finite structure is a sub-exponential structure since for 
n greater than the domain size, there are no n-subsets at all, hence zero orbits. But also the 
structure (Q; <) (and all structures with domain Q whose relations are first-order definable 
in (Q; <)) is sub-exponential: it has only one orbit of n-subsets, for all n (see e.g. p7]). Our 
main result is the following. 

Theorem 1.1. LetT be a sub- exponential structure with finite relational signature. IfT has 
a semi-lattice polymorphism, then CSP(r) can be be solved in polynomial time. 

Finite domain structures with a semi-lattice polymorphism can be solved by a standard 
technique, known as establishing arc-consistency [H]. The situation is different for sub- 
exponential structures with a semi-lattice polymorphism. Consider for instance the structure 
(Q; {(x, y, z) \ x > y V x > z}). It has the same automorphism group as (Q; <) and hence 
is sub-exponential as well. This structure has the function (x,y) \— >■ min(x,y) on Q as a 
polymorphism, but it has been shown that there is no k such that this problem can be solved 
by establishing ^-consistency [5j. Tractability of CSP((Q; {(x, y, z) \ x > y V x > z})) instead 
follows from the following more general result of [4]: every structure T with domain Q that 
has the same automorphism group as (Q; <) and that is preserved by the minimum function 
(or the maximum function) has a polynomial-time tractable CSP. Our result will be a proper 
generalization of this result and of the mentioned result of Jeavons, Cohen, and Gyssens. 

Let us remark that for a general infinite structure, a semi-lattice polymorphism does 
not suffice to ensure tractability. For an arbitrary subset U of N, let Tjj be the structure 
(N; {{x,y) G N 2 | x = y + 1}, {0}, U). Every such structure has min (and max) as a semi- 
lattice polymorphism. We claim that CSP(F[/) and CSP(ry) are different problems for 
distinct subsets U and V of N. Let m be any element in U AV, where A denotes the sym- 
metric difference of the sets, and let A m be the instance on variables {xq, . . . , x m } containing 
the constraints xq = 0, Xi = + 1 for 1 < i < m, and U(x m ). Then A m is a satisfiable 
instance for precisely one of the problems CSP(F(/) and CSP(ry). It follows that there are 
as many pairwise distinct CSPs of this type as there are subsets of N, i.e., uncountably many. 
However, there are no more than countably many algorithms, hence CSP(r;y) is undecidable 
for some U . 

The example of the previous paragraph relies on the fact that for all structures Tjj, there is 
an infinite number of orbits of 2-subsets. When the number of orbits of n-subsets of T is finite 
for all n, then the structure is called uj- categorical in model theory [17]. For w-categorical 
structures, polymorphisms still capture the computational complexity of CSP(r) [6]. Our 
main result can thus be seen as a contribution to the further extension of the universal- 
algebraic approach from finite to cj-categorical structures. 

Overview. Our paper is structured as follows. In Section [2] we introduce a new algorithmic 
technique to solve infinite-domain constraint satisfaction problems, and present a reduction of 
CSP(r) for structures T with a semi-lattice polymorphism to an efficient sampling algorithm 
for r. The basic idea is that when there is such an efficient sampling algorithm for T, then we 
can use the arc-consistency procedure for finite domain CSPs to solve CSP(r) (we actually use 
the uniform version of the arc-consistency procedure where both A and a finite template B 
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are part of the input). In fact, this technique works under a slightly more general assumption 
on T: instead of requiring the existence of a semi-lattice polymorphism, we only require that 
r has totally symmetric polymorphisms of all arities. 

The next part of our paper, Section O is devoted to the proof that all sub-exponential 
structures T with totally symmetric polymorphisms of all arities admit such an efficient sam- 
pling algorithm. Here, our proof is based on a classification of those structures T. We would 
like to remark that the general algorithmic technique is applicable also for many structures 
with totally symmetric polymorphisms of all arities that are not sub-exponential, and this will 
be illustrated by some examples in Section [U In fact, we make the conjecture that when T is 
an w-categorical structure with a semi-lattice polymorphism, then CSP(T) is in P. However, 
unlike the case of sub-exponential structures, we cannot provide a classification result like the 
one in Section [3] for this more general case, and so this remains an interesting open question. 

2. Algorithm 

One of the basic building blocks of our algorithm will be the arc-consistency procedure. 
We start this section by recalling, in the case of finite relational structures, the connection 
between the applicability of this procedure, homomorphisms from the set structure, and the 
existence of totally symmetric polymorphisms. 

Let B be a finite structure with a finite relational signature, and let A be an instance 
of CSP(-B). The arc- consistency procedure (AC) applied to the problem (A, B) works by 
reducing a set of possible images for each variable in A. If such a set becomes empty during 
the procedure, it follows that there can be no homomorphism, so AC rejects. Otherwise, 
AC accepts. An algorithm for AC is shown in Figure [TJ We say that arc-consistency solves 
the problem CSP(S) if, for every instance A, the procedure accepts if and only if A — > B. 
Arc-consistency is sometimes called hyperarc- consistency when applied to structures with 
relations of arity greater than two. It can be implemented to run in time that is polynomial 
in \A\ + \B\, take e.g. AC-3 [U]. 

The set structure of B, denoted by V(B), has as vertices all non-empty subsets of the 
domain of B. For every k-avy relation R , we have (U!,...,U k ) € R V[B) iff for every i 
and Ui £ f/j, there exists u\ € U%,...,Ui-i € € Oi+i, • • • , Uk € Uk such that 

(«!,..., « fe ) <E R B . 

A k-aiy function / : D k — > D is called totally symmetric if for all x\, . . . , x k , yi, ■ ■ • , yu €E D 
we have 

f(x 1 , . . . ,x k ) = f(yt, ...,yk) whenever {xi, ...,x k } = {y x , . . . ,y k }. 

We say that a structure B has totally symmetric polymorphisms of all arities if, for each 
k > 1, there is a fe-ary polymorphism of T that is totally symmetric. 
The following is well-known, cf. |13yi5j. 

Theorem 2.1. Let B be a finite structure with a finite relational signature. The following 
are equivalent. 

(1) The arc- consistency procedure solves CSP(-B). 

(2) There is a homomorphism V(B) — > B. 

(3) The structure B has totally symmetric polymorphisms of all arities. 

It is clear that when T has a semi-lattice operation /, then it also has a totally symmetric 
polymorphism f n of arity n, for each n > 2: 

f n (xi, ...,x n ):= f(xt, f(x 2 , . . • , f(x n -i,x n ) ...)). 
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AC(A, B) 

II Input: Finite relational structures A and B over the same signature 
// Accepts iff A-» V{B) 
for all x e D{A) do 

h(x) := D(B) 
repeat 

for all R A G A do 

for all (#1, . . . , Xk) € i? A do 
for all i = 1 , . . . , k do 

/i(xi) := Tti(R B H x • ■ • x h(xk)) 

until /i(x) does not change for any a; S -D(-A) 
if h(x) — for some x € D(A) then reject 
else accept 



Figure 1. Algorithm for AC(A, B), closely following the pseudocode given in |12| . 

From Theorem 12, H it thus follows that arc-consistency solves CSP(-B) whenever B is a finite 
relational structure with a semi-lattice polymorphism. In our arguments, we only need the 
weaker condition on totally symmetric polymorphisms of all arities; this gives us a stronger 
result. 

The other component of our algorithm will be a procedure to efficiently "sample" appropri- 
ate finite substructures of an infinite structure T. Formally, we make the following definition. 

Definition 2.1. Let V be a structure over a finite relational signature. We say that an 
algorithm is a sampling algorithm for V if, given a positive integer n, it computes a finite 
structure B that is isomorphic to a substructure ofT such that A — > B if and only if A — > T, 
for every instance A with \A\ =n. A sampling algorithm is called efficient if its running time 
is bounded by a polynomial in n. 

We are now ready to give an outline of our algorithm. Let T be a sub-exponential structure 
over a finite relational signature, and assume that T has totally symmetric polymorphisms of 
all arities. 



csp(r) 

/ / Input: A finite relational structure A over the same signature as T 

II Accepts iff A -> T 

B := Sample-r(|A|). 

if AC(A,B) rejects then reject 

else accept 



Figure 2. Algorithm for CSP(r), with T being a sub-exponential structure 
with totally symmetric polymorphisms of all arities. 'Sample-r' is a sampling 
algorithm for T. 

The main idea of our algorithm is to reduce CSP(r) to an appropriate uniform finite domain 
CSP. That is, when given an instance A of CSP(r), we reduce to the following problem: decide 
whether A maps homomorphically to B, where B is a finite substructure of T returned by 
a sampling algorithm for T on input \A\, and B is considered as part of the input. The 
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algorithm is given in Figure [2j Note that we rely on arc-consistency for deciding whether 
A maps homomorphically to B. Hence, for this approach to work, we need to establish the 
following. 

(1) There should be an efficient sampling algorithm which samples some B from V. 

(2) The arc-consistency procedure applied to (A, B) should accept if and only if A B. 

The first condition implies that the size of B is polynomial in the size of A, and since AC can 
be implemented to run in time that is polynomial in \A\ + \B\, it follows that our algorithm 
will be polynomial in \A\. The second condition ensures that the algorithm gives the correct 
answer for every instance A of CSP(T). 

It will be the purpose of Section[3]to prove that an efficient sampling algorithm for T exists. 
We state this result as follows. 

Theorem 2.2. Let T be a sub- exponential structure with a finite relational signature and 
totally symmetric polymorphisms of all arities. Then there is an efficient sampling algorithm 
for r. 

We next prove one part of a generalization of the equivalence between the second and 
third item of Theorem 12. II to infinite domains. This result has a converse for all w-categorical 
structures, cf. Section [5j 

Lemma 2.3. Let T be a structure over a finite relational signature. IfT has totally symmetric 
polymorphisms of all arities, then V(S) — > T for all finite substructures S C T. 

Proof. Let S be a finite substructure of T, and let / be an m-ary totally symmetric poly- 
morphism of r, where m = k\S\ and k is the maximum arity of any relation in T. Let 
/' : V(S) — >■ T be the function defined on a non-empty set X = {x\ , . . . , x^} C S by 
f'(X) = f(xi, . . . , Xi, Xi, . . . , Xi), where the list of arguments of / is padded to length m by 
elements already occurring in X. We claim that /' is a homomorphism V(S) — > T. We must 
show that (f'{U\), . . . , f'(Uk)) € R r for an arbitrary relation R, and tuples (U\, . . . ,11^) € 
rP(S)_ For each i < i < k, let Ti Q R s (1 (U! x ■■■ x U k ) be a set of /c-tuples such that 
{t[i\ | t £ Ti] = Ui. Note that {t[i] \ t € Tj, 1 < j < k} = Ui, and that we can choose Tj 
to have size l(i) := \Ui\ < \S\, for each i. Let un, . . . ,Uuu\ be an enumeration of Ui and 
t\, . . . , be an enumeration of Tj. We then have 

f\Ui) = f'{{un ,...u im }) = f(t\[i],..., t} {l) [i\, . . . ,t\[i], . . . , t k l[k) [i] , t k l{k) [i] , . . . , t k m [i] ) , 

for all 1 < i < k. Since / is a polymorphism of V, and t l - 6 R s C R r for all i and j, it follows 
that (/'(E/i), . . . , f'(U k )) eR r . □ 

The correctness and efficiency of our algorithm now follows from the previous lemma in 
conjunction with the existence of an efficient sampling algorithm for T. 

Theorem 2.4. Let T be a structure over a finite relational signature with totally symmetric 
polymorphisms of all arities. If there exists an efficient sampling algorithm for T, then the 
algorithm in Figure® correctly solves CSP(T) in polynomial time. 

Proof. Let A be the input structure and let B be the structure returned by the sampling 
algorithm for V on input \A\. The sampling algorithm runs in polynomial time in \A\, so 
the size of B will be polynomial in \A\. Since AC(A,B) can be implemented to run in time 
polynomial in \A\ + \B\, it follows that the entire algorithm runs in polynomial time. 
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To show correctness, note that if AC(A,B) rejects, then A -fr B which is equivalent to 
A t^> r since B was produced by the sampling algorithm for V on input \A\. We may 
therefore safely reject. Otherwise, KC{A,B) terminates with a non-empty list h(x) C B 
for each x £ A. Furthermore, for each /c-ary relation R A , tuple (x±, . . . ,Xk) € R A , index i, 
and element d € h(xi), we know that R B n h{x\) x • • • x h{xk) contains a tuple (di, . . . , dfc) 
with dj = d. In other words . . . , h(xk)) G R V ^ B \ so the function h : A ^ V(B) is 

a homomorphism. By assumption, T has totally symmetric polymorphisms of all arities, so 
Lemma 12.31 implies the existence of a homomorphism g : V(B) — > T. In conclusion, we have 
a homomorphism g o h : A — > T. □ 

As a direct corollary of Theorem 12.21 and Theorem 12.41 we get the following result, which 
also implies our main result, Theorem ll.il 

Theorem 2.5. Let T be a sub- exponential structure with a finite relational signature. IfT has 
totally symmetric polymorphisms of all arities, then CSP(r) is solvable in polynomial time. 

3. Classification 

The purpose of this section is to prove Theorem 12.21 showing that every sub-exponential 
structure has an efficient sampling algorithm. Our approach is based on a classification of 
sub-exponential structures with totally symmetric polymorphisms of all arities. The gen- 
eral outline of this classification is as follows. We first present an argument that reduces 
the classification task to those sub-exponential structures T that are model- complete cores 
and have totally symmetric polymorphisms of all arities (Section I3.2[) . From there on, the 
classification follows a decomposition of the automorphism group of T. The next step is the 
reduction to those structures T having a transitive automorphism group (Section I3.3[) . We 
then use the fact that the automorphism group of T has only finitely many congruence re- 
lations to further reduce the classification task to the case that the automorphism group of 
r is primitive (Section 13. 4p . Combining the central theorem from [20] on primitive permu- 
tation groups with a sub-exponential number of orbits of n-subsets with Cameron's theorem 
on highly set-transitive permutation groups [9j we finish the classification in Section 13.51 

We find it instructive to give a 'top-down' presentation of the classification proof, rather 
than starting from special cases, and to assemble more general sub-exponential structures from 
specific ones. We thus take a decomposition approach, and show first how to describe the 
most general case in terms of its components. This sometimes leads to forward-references of 
results, but we believe that the reader will be compensated by a more accessible presentation. 

3.1. Preliminaries. Before we start, we recall a basic fact which will frequently be used 
in the following arguments, and which explains the interaction between permutation group 
theory and logic for w-categorical structures. Let T be a relational structure. In this paper, we 
say that a relation R over the domain of T is first-order definable in T if there is a first-order 
formula <j){x) such that (j)(d) is true in (r, a) if and only if a € R. We say that a relational 
structure V is first-order definable in T if V and T have the same domain and every relation 
R in r' is first-order definable in T. We say that two relational structures are first- order 
interdefinable if one of them is first-order definable in the other, and vice versa. 

Theorem 3.1 (see e.g. Theorem 6.3.5 in [17]). Let T be co-categorical. Then a relation R 
is preserved by all automorphisms of T if and only if R has a first-order definition in T. 
In particular, two structures are first-order interdefinable if and only if they have the same 
automorphism group. 
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This theorem makes possible a translation of terminology between logic and permutation 
groups. We illustrate its use with the following, which will be needed later on. A congruence 
of a permutation group is an equivalence relation that is preserved by all permutations in the 
group. A permutation group is called primitive if the only congruences are the equivalence 
relation with just one equivalence class, and the equivalence relation where all equivalence 
classes are of size one; it is called imprimitive otherwise. By Theorem 13. 1\ the congruences of 
the automorphism group of an w-categorical structure T are precisely the first-order definable 
equivalence relations of T. We will say that T is primitive if its automorphism group is 
primitive. 

A permutation group G on a countable set X is called closed if and only if it is the 
automorphism group of a relational structure with domain X. The topological explanations 
for this terminology can be found in |16j . Theorem 2.4.4. The corresponding topology is 
called the topology of pointwise convergence on Sym(X), where Sym(X) denotes the set 
of all permutations of X. In this topology, the open sets are unions of sets of the form 
{a £ Sym(X) | a(x) = y}, for n-tuples x, y of elements of X. A subset H of G C Sym(A) is 
dense (in G) if the closure of H with respect to this topology equals G. 

3.2. Model-Complete Cores. An endomorphism of a relational structure T is a homomor- 
phism from T to V; we denote the set of all endomorphisms of T by End(r). A relational 
structure is called a core if every endomorphism of T is an embedding^. For a relational 
structure A, a core of A is a core structure T that is homomorphically equivalent to A, that 
is, there is a homomorphism from T to A and vice versa. A first-order formula is called 
primitive positive if it is of the form 

3xi, • • -,x n (ipi A • • • A Vn), 

where ip\, . . . , ip n are atomic formulas. The importance of primitive positive definitions in this 
paper comes from the fact that relations with a primitive positive definition in a relational 
structure V are preserved by the polymorphisms of V. 

The motivation of these definitions for constraint satisfaction with finite templates A comes 
from the following facts. 

• Every finite relational structure A has a core T, and T is unique up to isomorphism. 

• When r is a core of A, then T and A have the same CSP. 

• In a finite core structure V, every orbit of n-tuple^l is primitive positive definable. 
These properties have been generalized to w-categorical structures. To precisely state the 

result, we need the following concepts. A structure T is model- complete if every embedding 
of r into itself preserves all first-order formulas. We later need the following. 

Lemma 3.2 (Combination of Theorem 18 in [2] and proof of Corollary 7 in [7J). An uj- 

categorical relational structure A is a model- complete core if and only if the group of auto- 
morphisms of A is dense in the endomorphism monoid of A, that is, for every endomorphism 
f and finite subset U of D(A), there is an automorphism g of A agreeing with f on U. 

Theorem 3.3 (Theorem 16 in [2]; see also [3]). Let A be an oj- categorical relational structure. 
Then 

2 An embedding of a relational structure V into a relational structure A is an isomorphism between F and 
an induced substructure of A. 

^When (ti, . . . , t n ) is an n-tuple of elements of F, then the orbit oft (in V) is the set {(a(ti), . . . , a(t n )) \ a G 
Aut(F)}. 
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(1) A is homomorphically equivalent to a model- complete core V; 

(2) the structure V is unique up to isomorphism, and uj -categorical or finite; 

(3) in r, every orbit of n-tuples is primitive positive definable. 

For our classification project (and our algorithmic result), it therefore suffices to study 
the CSPs for model-complete cores of sub-exponential structures. Let us first show that the 
model-complete core of a sub-exponential structure is again sub-exponential. This follows 
from the following more general result. 

Proposition 3.4. Let A be an oj- categorical relational structure, and let V be its model- 
complete core. Then for every n, the number of orbits of n-subsets in V is at most the number 
of orbits of n-subsets in A. 

Proof. Let / be a homomorphism from A to T, and g be a homomorphism from T to A. Since 
r is a core, it follows that / o g is an embedding, so g is injective. It now suffices to show 
that when two n-subsets t±,t2 of T are mapped by g to two n-subsets si, S2 in the same orbit 
of n-subsets in A, then t\ and t% lie in the same orbit of n-subsets in T. Let ti be an n-tuple 
listing all the elements in t±, let a be an automorphism of A that maps si to S2, and let 
^2 = ct{g{i\)). Since s~2 lists all the elements of S2, we can arrange the elements of t2 into an 
n-tuple t,2 such that 5(^2) = ^2. By Theorem I3.3li3j) . there are primitive positive definitions 
4>\ and 4>2 of the orbits of t\ and £2- Since g, a, and / preserve primitive positive formulas, 
the tuple £3 := f(a(g(ti))) satisfies (j>\. But f(a(g(ti))) = f(g(t2)), and hence £3 also satisfies 
4>2- Therefore, <pi and ^2 define the same orbit of n-tuples, and so i\ and £2 are in the same 
orbit. This implies that t\ and £2 are in the same orbit of n-subsets. □ 

In general it might not be true that the model-complete core of a sub-exponential structure 
with a semi-lattice polymorphism has again a semi-lattice polymorphism. A finite example 
of this situation can be derived from Proposition 5.2 in [19]. This example shows a finite 
poset with a semi-lattice polymorphism which retracts to a poset without a semi-lattice 
polymorphism. By introducing constants, the latter structure can be turned into a core of 
the former. However, we always have the following. 

Proposition 3.5. Let A be an u- categorical relational structure with an n-ary totally sym- 
metric polymorphism. Then the model- complete core of A also has an n-ary totally symmetric 
polymorphism. 

Proof. Let T be the model-complete core of A, and let g : A — > T and h : T — > A be homomor- 
phisms. When / is an n-ary totally symmetric polymorphism of A, then /' : D(T) n — > D(T) 
defined by (xi, . . . ,x n ) 1— s- g(f(h(xi), . . . , h(x n ))) is totally symmetric, and a polymorphism 
of r. □ 

Once we have a sampling algorithm for the core T of a structure A, we obtain a sampling 
algorithm for A as follows. Let h : T — > A be a homomorphism and let B' be a sample 
returned by the sampling algorithm for V on input n. We then let the sample B of A be the 
substructure induced by the image of B' under h. For any structure A of size n it follows 
that A — > A implies A — > T which implies A — )• B' — >• B, so B is indeed a sample of A. The 
structure B can be computed in polynomial time in the size of B' which in turn is polynomial 
in n, so B can be computed in polynomial time in n. We conclude that it suffices to show 
Theorem 12 .21 for the special case of sub-exponential structures that are model-complete cores 
and have totally symmetric polymorphisms of all arities. 
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3.3. Reduction to the Transitive Case. Let T be a sub-exponential model-complete core. 
Since T is sub-exponential, it has in particular a finite number of orbits of 1-subsets, called 
orbits for short. A structure is called transitive if it has only one orbit. 

Proposition 3.6. Let T be an u>- categorical model- complete core, and let T' be the expansion 
of r by all primitive positive definable relations. Let U be an orbit of T, and A' be the 
restriction ofV to U. Then A' is a transitive model- complete core. 

Proof. First observe that every automorphism a of T is also an automorphism of T' and pre- 
serves U, and hence a\jj is an endomorphism of A'. Since the same also applies to the inverse 
of a, we have that also a\jj has an inverse in End(A'), and therefore is an automorphism of 
A'. So, the restriction of an automorphism of T to U is an automorphism of A', and therefore 
A' is transitive. 

To show that A' is a model-complete core, let e be an endomorphism of A', and let t 
be a fc-tuple of elements from U . Then any primitive positive formula that holds on t in V 
also holds on e(t), since A' is the restriction of an expansion of T by all primitive positive 
definable relations. Since T is a model-complete core, the orbits of fc-tuples are primitive 
positive definable in T by Theorem 13.3( 3). and hence there is an automorphism a of T that 
maps e to e(t). Then a\u is an automorphism of A'. This shows that Aut(A') is dense in 
End(A'), and the statement follows from Lemma 13.21 □ 

The following result is proved in Section 13.41 and Section 13.51 

Theorem 3.7. Let A be a transitive sub- exponential model- complete core with totally sym- 
metric polymorphisms of all arities. Then A is either a structure of size 1, or it is isomorphic 
to a structure which is first-order interdefinable with (Q; <). Any such structure with a finite 
relational signature has an efficient sampling procedure. 

So we are left with the task to design an efficient sampling algorithm for T using the efficient 
sampling algorithms for each substructure of T induced by an orbit of T. For this, we need 
to analyze how the automorphism group G of T is built from its transitive constituents, that 
is, from the permutation groups of the form {a\u \ a £ Aut(T)} on U where U is an orbit of 
T. In general, we only know that G is a subdirect product of its transitive constituents (see, 
e.g., [Ll])- In our case, we can make this decomposition more precise, since we have a good 
knowledge of the group Aut((Q; <)). 

Lemma 3.8. Let V be a sub- exponential structure with an Aut(T) -invariant linear order < 
defined on the union of two orbits U and V. Then U and V are convex with respect to <. 

Proof. Assume to the contrary that there are elements u\,u-i E U, v\ 6 V such that u\ < 
v\ < U2- Since u\ and U2 lie in the same orbit, there is an automorphism a of T such that 
a{u\) = U2- Let Vi + \ = a(vi) for i = 1, . . . , m — 1, and let Uj + i = a(ui) for i = 2, . . . ,<m — 1. 
Since a preserves the order <, we have «2 = a(ui) < a{v\) = V2, and t>2 < a(u2) = u^. 
By repeated application of a, we obtain -Uj < vi < u^+i for all i = 1, ... ,m — 1, and finally 
Um < Vm- Hence, we can encode sequences s G {0, l} m in subsets S C T of size m by 
letting Ui £ S iff s« = and V{ € S iff S{ = 1. Different sequences of length m then 
correspond to different orbits of m-subsets in T. This contradicts the assumption that T is 
sub-exponential. □ 

We now describe the automorphism groups of sub-exponential structures where the tran- 
sitive constituents G\, . . . , Gk for all orbits U\, . . . ,Uk of G are isomorphic to Aut((Q; <)): 
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the following theorem shows that in this case G is precisely what Cameron [TO] calls the 
intransitive action of the direct product G\ x • • • x Gk on U\ U ■ ■ ■ U Uf.. We will say that a 
permutation group G C Sym(X) is transitive on a subset Y C X if for all x,y G Y, there 
exists a permutation a G G such that a(x) = y. Otherwise, we say that G is intransitive on 
Y. 

Theorem 3.9. Let T be a sub- exponential structure with automorphism group G and orbits 
U\, . . . , Uk, and for 1 < i < k let Gi be the transitive constituent of G on f/j. Assume that Gi 
is isomorphic to Aut((Q; <)) for all 1 <i < k. Then, a G Sym(D) is an automorphism ofT 
if and only if a\i/. G Gi for each 1 < % < k. 

Proof. Since Gi is isomorphic to Aut((Q; <)) for each i, there is by Theorem 13. II a first-order 
definable dense linear order on each orbit C/j. We will use < to collectively denote these 
orders, and rely on the context to determine which orbit it applies to. 

The result is trivial for k = 1, so assume that k > 1. Let G' be the intransitive action 
of G\ x • • • x Gk on U\ U • • • U Uk- It suffices to show that G is dense in G'; since G is 
closed, they must then be equal. Assume for the sake of contradiction that G is not dense 
in G' . Then, there are finite sequences (u,u') and (v,v') in T, and an automorphism a G G' 
such that a(u,u') = (v,v'), but j(u,u') ^ (v,v') for all 7 G G. Furthermore we can choose 
(u, u') and (v,v') so that (3(u) = v for some j3 G G. By applying /3 _1 to v and a, we may 
then assume that v = u and a(u) = u. By a u-interval, we will mean an inclusion-maximal 
convex subset of U{ \ u, for some i, where convexity is evaluated with respect to C/j. Now 
u' lies in some u-interval I, and since v' = a(u'), we have v' G / as well. By assumption 
we have ~f(u') 7^ for all 7 G Aut((T,n)), so Aut((r,u)) is intransitive on /. On the other 
hand, if we let w := uPiU, where U is the orbit of T containing /, then Aut((T, w)) is clearly 
transitive on each ?D-interval contained in U: each u)-interval contained in U is an orbit, 
and its corresponding transitive constituent of Aut((T,u;)) is isomorphic to Aut((Q; <)). In 
particular, Aut((T, is transitive on /. It therefore follows that we can find a subsequence 
a of u containing w, and an element b U such that Aut((T,a)) is transitive on / but 
Aut((r, a, b)) is not. 

By Lemma 13. 8^ we have that every orbit of (T, a, b) contained in / is convex. Since 
Aut((r,a,6)) is intransitive on /, it follows that there is an initial segment 1(b) C / de- 
finable in (T, a, b). Let x G 1(b), y G I \ 1(b), and pick an automorphism a G Aut((T,a)) 
such that a(x) = y. Now a(I(b)) 5 1(b) and a(I(b)) is definable in (T,a,a(b)). Let b\ = b 
and for i > 1, let b, L+ i = a(bi). By repeating this procedure, we get an increasing sequence 
of sets 1(h) C I(b 2 ) C • • • C I(b m ) which are all definable in (T, a,b), where b — (bi, • • • , b m ). 
For i > 1, pick q G I(bi + i) \ I(bi), so that each of the elements q lies in a different orbit of 
(r, a, b). We now encode a set S C {1, . . . , m} as the subset T of T consisting of the elements 
in a, b, and {q | i G S}. Let S" C {1, . . . , m) be another set with encoding T' , and let a G G 
be an automorphism such that a(T) = T' . Then a has to fix a and b, i.e., a must be an 
automorphism of (T,a,b). Therefore a cannot map to Cj for z 7^ j, so S 1 must be equal 
to S' . The set T has size at most 2m + |a|, and since we can fix the size of o, we conclude 
that the number of distinct orbits of m-subsets of T is at least f2(2 m / 2 ). This contradicts the 
sub-exponentiality of T, so G must be dense in G' , and the result follows. □ 

Together with the remarks of Section 13. 2\ the following implies Theorem 12.21 

Theorem 3.10. Every sub- exponential model- complete core V with totally symmetric poly- 
morphisms of all arities and a finite relational signature r has an efficient sampling algorithm. 
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Proof. For a given n, we have to compute a r-structure B in polynomial time in n such that for 
all structures A of size n we have A — > V if and only if A — > B. Let U\ , . . . , £/& be the orbits of T. 
Let r' be the expansion of T by all primitive positive definable relations, and let A' 1; . . . , A' k be 
the structures induced in T' by the orbits U\, . . . , £/& of F. By Proposition l3.6|, A^ is a transitive 
model-complete core, for all i < k. Since A^ also has totally symmetric polymorphisms of 
all arities (obtained as the restrictions of the totally symmetric polymorphisms of T to t/j), 
we can apply Theorem 13.71 and conclude that each of the structures A£ either has size 1, 
or is isomorphic to a structure which is first-order interdefinable with (Q; <). We will now 
prove that an efficient sampling procedure exists in the case when each of the structures A • is 
isomorphic to a structure which is first-order interdefinable with (Q; <). The proof can easily 
be modified to handle the case when some of the structures A^ have size 1. 

Let m be the maximal arity of r, and let a be the signature that contains a relation symbol 
for each at most m-ary primitive positive definable relation in r. For all % < k, let Aj be 
the structure obtained from A^ by removing all relations except the relations for the symbols 
from a. Then Aj has a finite signature, and we conclude by Theorem 13,71 that Aj has an 
efficient sampling procedure. 

Let Bi be the a-structure produced by this sampler for Aj on input n; we can assume that 
it has exactly n vertices u\,...,u l n . The output of our algorithm will be the r-reduct of a 
a-structure B with vertex set {u l j | 1 < j < n, 1 < i < k}. Since the orbits are primitive 
positive definable in T, there is a unary relation symbol R(U) in a for each orbit U of T. The 
structure B will be such that € R(Ui), for all i < k and j < n. Note that by Theorem 13. 9| 
all such structures B will be isomorphic. 

For R E t, we now add tuples to the relation R B as follows. We first fix a partition P of 
the arguments of R into at most k parts. Let p\, . . . ,p}^ be the arguments of R of the i-th 
part. For each i the relation 

{(t[p\], . . . , t\pj ii} }) | t € R and t\p>] G Uj for all j < k, I < 

is primitive positive definable in T, and there is a relation symbol R(P, i) for this relation 
in a. It is clear that given P and the relations R(P, i) of Bi we can efficiently compute the 
relation 

R(P) = {t | (t\p{], . . e R(P,i) for all i < k} . 

It follows from Theorem 13.91 that R equals the union of R(P) over all partitions P of the 
arguments of R - and since there is a constant number of such partitions, R can be computed 
in polynomial time. □ 

We are left with the task to prove Theorem 13.71 

3.4. The Transitive Case is Primitive. We now proceed to show that when T is a transi- 
tive sub-exponential model-complete core with totally symmetric polymorphisms of all arities, 
then it is also primitive. Combining this result with the following two theorems shows that 
r is isomorphic to a structure with a first-order definition in (Q; <). This allows us to finish 
the proof of Theorem 13.71 in the next section. 

Theorem 3.11 (cf. |20j). Let G be a primitive but not highly set-transitive permutation group 
on an infinite set X. If c is a real number with 1 < c < 2 1 / 5 , then G has more than c n orbits 
of n-subsets of X, for all sufficiently large n. 
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Theorem 3.12 (Cameron [9]). A permutation group G on an infinite set is highly set- 
transitive iff it is isomorphic to the automorphism group of a structure with a first- order 
definition in (Q; <). 

We will need two more lemmas. Let it! and S be two binary relations. An alternating closed 
walk on R and S of length 2n is a sequence of elements (xo,x±, . . . , x 2n ), with X2 n = xq, and 
such that {x2i,X2i+\) G R and (x2i+i, X2i+2) G S, for < i < n. 

Lemma 3.13. Let R and S be two binary relations that are preserved by a totally symmetric 
function f n of arity n > 1. If there is an alternating closed walk on R and S of length 2n, 
then R n S' 1 + 0. 

Proof. Since (x 2 j,x 2i+ i) G R for < i < n, we have (y,z) G R, for y = f n (xo,x 2 , ■ ■ ■ ,£271-2) 
and z = / ra (xi, X3, . . . , X2n-i)- Similarly, since (a^i+i, £21+2) G S for < i < n, we have 
(y',z f ) G S, for y' = f n (xi,x 3 , . . . ,x 2n -i) and z' = / n (x 2 ,x 4 , . . . ,x 2n ). Note that 

V = fn(xo,X 2 , ■■■ , X 2n - 2 ) = fn(x 2 ,X A , . . . , X 2n = Xq) = z' , 

and that z = y' . Therefore, (y, z) G R and (z, y) G 5, hence (y, z) G i? Pi 5 . □ 

An orbital of T is an orbit of Aut(T) acting component- wise on ordered pairs of elements 
of r. Every structure always has the trivial orbital {(x,x) \ x G T}. 

Lemma 3.14. Let T be a model- complete core with finitely many orbitals and a totally sym- 
metric polymorphism f n of arity n for alln > 1. Let X be an equivalence class of a first-order 
definable equivalence relation on T. If a m (X) = X for some a G Aut(T) and m > 0, then 
a(X) = X. 

Proof. By assumption, there exists a smallest integer r > 1 such that a r (X) = X. Let x G X, 
and for fc£Z, let O(k) be the orbital of F containing the tuple (x, a k (x)). Then, we have the 
inclusion {{a n {x), a n+k (x)) | n G Z} C 0(k). 

Since T has finitely many orbitals, we can find integers < / < k such that 0(k) = 0(1). 
In fact, we can do this while ensuring that I = r — 1 (mod r). Note that (a l (x), a t+l+1 (x)) G 
0(1 + 1), and that (a l+l+l (x), a t+1 (x)) G 0(— I), for all i. In particular, the following sequence 
is an alternating closed walk on 0(1 + 1) and O(-l) = 0(l)~ l of length 2(k — I). 

(x,a l+1 (x),a 1 (x),a l+2 (x) 1 a 2 (x), . . . ,a k ~ l ~ 1 (x),a k (x),x) 

As r is a model-complete core, each orbital is primitive positive definable in T, and hence 
preserved by f n for each n > 1. From Lemma l3.13( it now follows that the orbitals 0(1 + 1) 
and 0(1) intersect, and therefore they must be equal. This implies that the tuples (x, q' +1 (x)) 
and (x, a l (x)) are in the same orbital, so there exists an automorphism /3 of T which fixes x 
and maps a l+1 (x) to a l (x). Since /3 fixes x, we have (3(X) = X, and from the choice of I, 
we have a l+l (X) = X. It follows that a l (x) = (3(a l+1 (x)) G X, so a l (X) = X, and hence 
a r ~ 1 (X) = X as well. Due to the choice of r, this is only possible if r = 1, so we conclude 
that a preserves X. □ 

We are now ready to prove the main result of this section. 

Proposition 3.15. Let T be a transitive sub- exponential model- complete core with totally 
symmetric polymorphisms of all arities. Then T has size 1, or it is infinite and primitive. 
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Proof. Assume that V has size at least 2. Let Eq and E\ denote the congruence of Aut(r) with 
equivalence classes of size 1, and the congruence with a single equivalence class, respectively. 
Let E be an inclusion-maximal congruence from the set of all congruences different from E\. 
Existence of E follows from T having finitely many first-order definable equivalence relations, 
and the existence of Eq. We want to show that E must in fact be Eq from which it follows 
that Aut(r) is primitive. 

Let D = D(T). By D/E we will denote the set of equivalence classes of E. For x € D, 
let x[E] denote the equivalence class of E containing x, and for a G Aut(r), let a/E denote 
the function on D/E which maps x[E] to a(a;)[-E] for each equivalence class x[E] of E. (It 
follows from E being a congruence that a/E is well-defined.) We then have that Aut(r)/.E := 
{a/E | q e Aut(r)} is a permutation group on D/E. 

Let H = Aut(T)/E. For each equivalence class Xi £ D / E, choose Xi € D such that Xi[E] = 
X{. Let {Xi, . . . , X n } and {Yi, . . . , Y n } be two n-subsets of D/E such that a{{x\, . . . , x n }) = 
{yi,---,Vn} for some a G Aut(r). Then, { a/E(Xx ),..., a/E(X n )} = {Y 1 ,...,Y n }, so the 
number of orbits of n-subsets of H is sub-exponential whenever T is sub-exponential. 

Assume that H is finite. Pick two distinct equivalence classes X and Y of E. Since T is 
transitive, we can find a € Aut(r) such that a(X) = Y. But if H is finite, then a m (X) = X 
for some m > 0, so a(X) = X by Lemma 13. 14} a contradiction. Therefore H must be 
infinite. Congruences of H are in one-to-one correspondence with the congruences of Aut(r) 
containing E. As the latter are precisely E and E\, it follows that H is primitive. 

It now follows from Theorem 13.11 1 that H is highly set-transitive, and so from Theorem l3.12l 
that H is isomorphic to the automorphism group H' of a structure with a first-order definition 
in (Q; <). The group H' either has one or two non-trivial orbitals. If it only has one non- 
trivial orbital, then so does H, hence for any two distinct equivalence classes X and Y of E, 
there is an automorphism a of V such that a(X) = Y and a(Y) = X. Again by Lemma l3.14[ 
it follows that a(X) = X, a contradiction. So H' has two non-trivial orbitals, one of which 
is the order < on Q. Via the isomorphism, H thus has a non-trivial orbital < which is a 
linear order on the equivalence classes of E. Let R be a binary relation on T defined by 
R(x,y) iff x[E] < y[E]. For a £ Aut(r), we have that a/E preserves <, so a preserves 
R. Assume now that the equivalence classes of E have size greater than 1. We can then 
encode a sequence in {0, l} n as a set {x\,y%, ■ ■ ■ , x n , y n } C T: choose Xi, yi so that R(yi,Xi + \) 
for 1 < i < n, encode a value in position i by enforcing E(xi,yi), ^ and encode a 
value 1 by enforcing R(xi,yi). The relations E, ^, and R are all preserved by Aut(r), so if 
two 2n-subsets encodes distinct sequences, then they must be contained in distinct orbits of 
2n-subsets. Hence, the number of orbits of 2n-subsets is greater than or equal to 2 n , which 
contradicts V being sub-exponential. So the equivalence classes of E are of size 1, i.e., E = Eq, 
and r is primitive. □ 

3.5. The Primitive Case. Assume that T is a transitive sub-exponential model-complete 
core with totally symmetric polymorphisms of all arities. If V is of size greater than 1, then it 
is infinite and primitive (Proposition 13.15]) . so it is highly set-transitive (Theorem 13. lip and 
hence isomorphic to a structure r' with a first-order definition in (Q; <) (Theorem 13. 12j) . To 
prove the first part of Theorem [3T71 we have to show that V is in fact first-order interdefinable 
with (Q; <). If the binary relation < is an orbital of V , then < is first-order definable in T' by 
Theorem 13. 1[ and interdefinability follows. Otherwise, the smallest orbital of V that contains 
< also contains a pair (x, y) such that x > y. It follows that T' has an automorphism a such 
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that a(x) = y and a(y) = x, i.e., a 2 (x) = x. Since T' has totally symmetric polymorphisms 
of all arities, we can apply Lemma 13.141 to deduce a(x) = x, a contradiction. 

To prove the second part of Theorem 13. 7^ we show that any structure with a finite relational 
signature and a first-order definition in (Q; <) has an efficient sampling algorithm. A rela- 
tional structure T is homogeneous (or ultrahomogeneous) if every isomorphism between finite 
induced substructures of T can be extended to an automorphism of V. A structure T with 
signature r admits quantifier- elimination if every first-order T-formula is over T equivalent to 
a quantifier-free T-formula. An o;-categorical structure admits quantifier-elimination iff it is 
homogeneous [TO] . Let [n] := {1, . . . ,n}. 

Theorem 3.16. Every structure T with a finite relational signature r and a first- order defi- 
nition in (Q; <) has an efficient sampling algorithm. On input n, the output of the algorithm 
is a structure of size n. 

Proof. The structure (Q; <) is homogeneous, and hence admits quantifier-elimination. Let 
<pl, . . . , 4>k be first-order definitions of the relations R\, . . . , i?£ of T, written in quantifier- free 
conjunctive normal form. For a given n, we compute a finite r-structure B with domain [n] as 
follows, where [n] is viewed subset of D(F) = Q. For each i, let Rf be the mj-ary relation 
over the domain [n] that contains all m^-tuples that satisfy (pi (where rrii is the number of 
free variables of (pi). Since (pi is of constant size in n, the relation Rf can be computed in 
time 0(n mi ). The resulting structure B = ([n];Rf, . . . ,R^) is clearly a substructure of T. 
Moreover, if A is a finite r-structure with n elements, and s : A — > Q is a homomorphism 
from A to T, then by high set-transitivity of T there is an automorphism a of T such that 
a(s(A)) C [n]. Hence, A homomorphically maps to T if and only if A homomorphically maps 
to B. □ 

4. Beyond Sub-Exponential Growth: Examples 

In this section we illustrate with a couple of examples the possibility of extending our 
tractability result beyond sub-exponential structures. All examples of w-categorical structures 
with totally symmetric polymorphisms of all arities that we are aware of have a first-order 
interpretation over the structure (Q; <). Interpretations are a central concept from model 
theory, which we briefly recall in the following. Let a and r be relational signatures, A a 
a-structure, and V a r-structure. A d-dimensional (first-order) interpretation I of V in A 
consists of (cf. [T7] ) 

(1) a a- formula dj{x\, . . . ,2^); 

(2) for each atomic r-formula (j)(yi, . . . , y m ), a cr-formula (f>i(x~i, ■ ■ ■ , x m ), where x%, . . . , x m 
are disjoint d-tuples of distinct variables; and 

(3) a surjective map // : d/((L>(A)) d ) T; 

such that for all atomic r-formulas <p and all en € dj(D(A) d ), <p(fj(ai), . . . , fi(a m )) holds in 
r if and only if 4>i(a~i, . . . , a m ) holds in A. 

Our interest in interpretations also stems from the following result. 

Lemma 4.1. Every structure T with a finite relational signature r and a d-dimensional 
interpretation in (Q; <) has an efficient sampling algorithm. On input n, the output of the 
algorithm is a structure of size at most (dn) d . 

Proof. Let I be a ci-dimensional interpretation of T in (Q; <). On input n, we compute a 
finite r-structure B which is the induced substructure of T on the domain fj(dj([dn] d )). For 
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an m-ary R G r, we do the following. Let (pj be the interpretation in (Q; <) of the atomic 
formula R(yi, ■ ■ ■ , y m ), given in quantifier-free conjunctive normal form. We now evaluate 4>i 
on each sequence of d-tuples, a\, . . . ,a m G dj([dn] d ). The formula (f)j(ai,. . . ,a m ) is true iff 
the tuple (//(ai), . . . , fi(a m )) is in R B . Since <fi is of constant size in n, it follows that we can 
compute the relation R B in 0((dn) dm ) time. The signature r is finite, so there is a relation 
of highest arity, independent of n, which provides the upper bound on the time complexity 
of the algorithm. 

Next, let A be a finite r-structure with n elements, and assume that s : A — > T is a 
homomorphism. This implies A — > B as well: the image of A under s has at most n elements, 
b\,...,b n £ T. Let 01, . . . ,a n be tuples in di(Q d ) such that fi(di) = bi, and let g : s(j4) — > 
di(Q d ) be the function such that g{bi) = di for all i. Note that ai,...,a n contains at 
most dn distinct values of Q. By high set-transitivity of (Q; <), it follows that there is an 
automorphism a of (Q; <) such that G [dn] for each i and j, where di = {an, . . . , a^). 

Now for each relation symbol R G r, we have (xi, . . . , x m ) G R r iff 4>i(g(xi), . . . , g(x m )) holds 
in (Q; <) iff 0/(a(^(aJi)), . . . , a(<?(x m ))) holds in (Q; <), and this in turn is true if and only if 
(fl(a(g(xi))), . . . , fj(a(g(x m )))) G R B . It follows that fioaogosisa homomorphism from 
A to B. □ 

Below, we give a number of examples of structures with exponential growth that are inter- 
pretable in (Q; <), and have semi- lattice operations. 

Example 1. Let '<', '=', and '>' denote the usual inequality and equality relations on Q. 
Let Ti be the relational structure over Q 2 with binary relations R P:(T for p,a G {<,=,>}, 
where R p ,<r((xi,yi), (^2)2/2)) is defined by p(x\,X2) /\a(yi,y2)- This structure is exponential. 
To obtain a lower bound on the growth rate of T\, pick an n-subset A = {a\, . . . , a n } C Q 2 , 
and assume that the projection of A on the second component contains exactly k distinct 
values. For 1 < i < k, let Ai denote the number of elements that have the ith largest second 
component value. Then, A\ + • ■ ■ + A^ = n determines a composition ofn, i.e., an expression 
for n as an ordered sum of positive integers. Two sets ,4,6 C Q 2 which determine different 
compositions must be in different orbits. Thus, the number of orbits of n-subsets of Q 2 is at 
least 2™" 1 , the number of compositions of n. 

Hence, Theorem 12.51 does not apply to Ti. Instead, tractability can be inferred as follows. 
The structure Ti has a two-dimensional interpretation I in (Q; <). The formula dj is always 
true, // is the identity on Q 2 , and the interpretations of R and S are as given above. Fur- 
thermore, it is easy to verify that Ti is invariant under the semi-lattice operations given by 
(component-wise) min and max. Tractability of CSP(Fi) now follows from Theorem 12.41 and 
Lemma 14.11 

For any d > 2, this example can be generalized to a structure with 3 d d-ary relations, and 
with a cZ-dimensional interpretation in (Q; <). The CSP of each such structure is polynomial- 
time solvable. □ 

Example 2. The age of a structure T is defined as the class of all finite structures isomorphic 
to a substructure of T. Let (R, S) be a signature with two binary relation symbols, and let 
T = {Ti, T2, T3, T4} be the set of structures in Fig. HI where the tuples of R (S) are given by 
the solid (dashed) arrows. Let C be the class of all finite structures with signature (R, S) for 
which every three-element substructure is isomorphic to a structure in T. It can be shown 
that C is an amalgamation class so that its Fraisse limit exists (cf. Theorem 6.1.2 in |17j). 
This is the up to isomorphism unique countable homogeneous structure with age C. 
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Figure 3. Four relational structures with signature (R,S); the relations R 
and S are given by the solid and dashed arrows, respectively. 

The following describes a relational structure T2 with age C which can be verified to be 
homogeneous. It follows that T2 is isomorphic to the Frai'sse limit of C. Let T2 = (Q 2 ; R, S), 
where R((x\, yi), (x 2 , V2)) is the relation x\ = x 2 A y\ < y 2 , and S((xx, yi), (x 2 , 2/2)) is the 
relation x\ < x%. The growth rate of V 2 can be bounded as in the previous example, and 
here it turns out that the number of orbits of n-subsets is precisely 2 n_1 . The structure 
T 2 also has a two-dimensional interpretation in (Q; <) and semi-lattice polymorphisms given 
by (component-wise) min and max, so tractability follows once again from Theorem 12.41 and 
Lemma 14.11 □ 

Example 3. Let T3 := (U U V; M, <) be the following relational structure. The domain UUV 
is the disjoint union of two copies of Q. The binary relation M defines a perfect matching 
between the elements of U and the elements of V, and the binary relation < defines a dense 
linear order on U U V such that u < v for all u € U and v £ V, and for vi,v 2 £ V, we have 
v\ < v 2 iff u\ < u 2 for the elements ui,u 2 G U with (u\,v\), (u 2 ,v 2 ) G M. 

The structure T3 is invariant under the semi-lattice operations given by min and max 
defined with respect to the order < on U U V. It has two orbits and Aut(r3) is isomorphic 
(as an abstract group) to Aut((Q; <)). By Theorem 13.91 this implies that T3 does not have 
sub-exponential growth. But T3 has a 2-dimensional interpretation I in (Q; <), so CSP(r3) 
is polynomial-time solvable: let di(x,y) be the formula x ^ y, and let fi(x,y) be the copy of 
x in U if x < y and the copy of x in V if x > y. The matching M on (xx, yi) and (x 2 ,y 2 ) is 
interpreted by the formula x\ = x 2 /\x\ < y\Ax 2 > y 2 and the order < on (a?i, y\) and (x 2 , y 2 ) 
is interpreted by the formula (x\ < y\ A x 2 > y 2 ) V {x\ < y\ A x 2 < y 2 A x\ < x 2 ) V {x\ > 
yi A x 2 > y 2 A xi < x 2 ). □ 

Every finite structure B has an interpretation / in (Q; <), in fact, even in (Q; =). Let n = 
\D\ and the dimension of the interpretation be d = 2n. The formula <9/(xi, . . . , x n , x[, . . . , x' n ) 
is true if and only if for exactly one i it holds that x^ = x[. Equality is interpreted by the 
formula 

n 

4>={xi,.. .,ar n ,s'i, ■■-,x' n ,yx,.. .,y n ,y[,.. . ,y' n ) = (/\(xi = sc-) o (yi = y[)). 

i=i 

It is now straightforward to write down first-order formulas <f)R that interpret the relations of 
B. When R is fc-ary, then the formula (f>R is a disjunction of conjunctions with 2nk variables 
xi,i, ■ ■ .,x 1>n ,x' lil , . . -,x' ln , . . . ,x k ,i, ■ ■ -,x kjn ,x' kl , . . -,x' kn . For each tuple (ii, ...,t k ) G R, 

the disjunction contains the conjunct Aj=i( x j,^ = x 'jtj)- 
The examples in this section suggest the following question. 

Question 4.1. Is it true that all lo '-categorical relational structures with totally symmetric 
polymorphisms of all arities have a first-order interpretation over (Q; <)? 
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5. Concluding Remarks 

In this article we prove that constraint satisfaction problems for templates T where the 
number of orbits of n-subsets of T grows sub-exponentially in n can be solved in polynomial 
time when V has a semi-lattice polymorphism. In fact, we showed the stronger result which 
only requires the existence of totally symmetric polymorphisms of all arities, instead of re- 
quiring the existence of a semi-lattice polymorphism. This algorithmic result can be showed 
in two stages: 

(1) In the first stage, we reduce CSP(r), for structures V with totally symmetric poly- 
morphisms of all arities, to solving certain uniform finite domain CSPs, and to the 
task to find an efficient sampling algorithm for Y. 

(2) In the second stage, we classify sub-exponential structures V with totally symmetric 
polymorphisms of all arities, and use the classification to verify that there always 
exists an efficient sampling algorithm for T. 

The reduction presented in the first stage crucially relies on the fact that when V has totally 
symmetric polymorphisms of all arities, then for all finite induced substructures S of T the 
set structure of S homomorphically maps to T (Lemma I2.3[) . We want to remark that this 
connection has a converse when V is w-categorical. 

Lemma 5.1. Let T be an uj- categorical structure over a finite relational signature. Then 
r has totally symmetric polymorphisms of all arities if and only if V(S) —¥ T for all finite 
substructures 5CI. 

Proof. The forward direction was proved in Lemma 12.31 The remaining direction can be 
proved using a common technique for constructing homomorphisms to w-categorical struc- 
tures. Given an arbitrary positive integer n, we want to produce an n-ary totally symmetric 
polymorphism / of T. The idea of the proof is as follows: let li,h, ■ ■ ■ be an enumeration of 
the elements of D = D(T), and let Lk = . . . ,lk}- For each k > 1, let Fk be the set of 
homomorphisms from V(T[Lk]) to T. Introduce an equivalence relation ~ on Fk by defining 
/ ~ g iff / = aog for some automorphism a of V. Let Fk denote the set of equivalence classes 
of Ffc under ~. 

Arrange the elements of Ufc>i ^k m ^ j ° a forest containing at least one infinite tree: each 
fi £ F\ is defined to be the root of a separate tree, and for each k > 1, and fk € Fk, define 
the parent of fk € Fk to be the equivalence class containing the restriction of fk to the non- 
empty subsets of Lk-i- This definition is independent of the choice of representative in fk, 
so each equivalence class in Fk, k > 1, has precisely one parent. Since T is w-categorical, it 
follows that there are finitely many equivalence classes for a fixed k. Hence, there are finitely 
many trees and each tree is finitely branching in each node. By assumption, Fk is non-empty 
for each k > 1, so some tree has unbounded height. Now, Konig's tree lemma implies the 
existence of an infinite path /i,/2, • • • in some tree. Assume that there are representatives 
/i £ Zii /2 € fi, ■ ■ ■ , fk € fk, such that fk-i is the restriction of fk to the non-empty subsets of 
L>k-\. We show that this path can be extended indefinitely: choose gk+i £ fk+i arbitrarily and 
let gk be its restriction to the non-empty subsets of Lk- Then, there exists an automorphism 
a such that fk = a o g k . It follows that fk is the restriction of a o g fe+1 to the non-empty 
subsets of Lk, hence we can define fk+i := a o gk+i- 

Now, for any n-tuple (x±, . . . , x n ) over D, define f{x\, . . . , x n ) = f m ({xi, ■ ■ ■ , x n }), where m 
is an any integer such that {x\, . . . ,x n } C L m . By the construction of the sequence /i, /2, • • • , 
the function / is a well-defined totally symmetric n-ary function on D. To verify that / is a 
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polymorphism of T, let R r be an r-ary relation, and t\, . . . ,t n € R r . Let L/j = {ii [«],... , t n [*]}> 
for 1 < i < r. Assume without loss of generality that m has been chosen large enough so 
that (Ji=i U{ C L m . Then f(ti[i], . . . , t n [i]) = f m (Ui) for all i. By definition of the set 
structure, we have (U\, . . . , U r ) £ iifW-M). 

Since / m is a homomorphism, we conclude that 
(fm(Ui), ■ ■ ■ , fm(U r )) G -R r , so / is indeed a polymorphism. □ 

Because of the general applicability of the algorithmic approach in Section [21 and because 
of the fact that the known structures of exponential growth that have totally symmetric 
polymorphisms seem to be well-behaved (see Section |3J), we make the following conjecture. 

Conjecture 5.2. Let T be an to- categorical structure with finite relational signature and 
totally symmetric polymorphisms of all arities. Then CSP(r) can be solved in polynomial 
time. 
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